Text Analysis and Knowledge Extraction
نویسندگان
چکیده
i. Introduction The study of text understanding and knowlegde extraction has been actively done by many researchers. The authors also studied a method of structured information extraction from texts without a global text analysis. The method is available for a comparatively sbort text such as a patent claim clause and an abstract of a technical paper. This paper describes tile outline of a method of knowledge extraction from a longer text which needs a global tex analysis. The kinds of texts ~e expository texts ~) or explanation texts-'. Expository texts described here mean those which have various hierarchical headings such as a title, a heading of each section and sometimes an abstract. In this deEinJtion, most of texts, including technical papers reports and newspapers, are expository. Texts of this kind disclose the main knowledge in a top-down manner and show not only the location of an attribute value in a text but also severn[ key points of the content. This property of expository texts contrasts with that of novels and stories in which an unexpected development of the plot is preferred. This paper pays attention to such characteristics of expository texts and describes a method of anal yzing texts by referring to information contained in the intersentential relations and the headings of texts and then extracting requested knowledge such as a summary from texts in an efficient way.
منابع مشابه
Presenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملText Mining
“Bag of words” model, acronym extraction, authorship ascription, coordinate matching, data mining, document clustering, document frequency, document retrieval, document similarity metrics, entity extraction, hidden Markov models, hubs and authorities, information extraction, information retrieval, key-phrase assignment, key-phrase extraction, knowledge engineering, language identification, link...
متن کاملارائه رویکردی برای مدیریت و سازماندهی اسناد متنی با استفاده از تجزیهوتحلیل هوشمند متن
Regarding the fact that stored data occupies a large space in organizations and retention systems and information management that has been resulted in gigantic data warehouses, the need for extracting an appropriate model is felt increasingly. Text mining is one of the most significant methods for extracting a useful and appropriate model that helps organizations in achieving their goals throug...
متن کاملInformation Extraction from Interviews to Obtain Tacit Knowledge: A Text Mining Application
One of the most challenging knowledge management tasks is to obtain, summarize, and present tacit knowledge. It is important to develop approaches for creating insightful summaries from the knowledge obtained. In this paper, we present different information extraction methods for summarizing interview transcripts. Manual, semi-automatic, and automatic text analysis are evaluated to transform ta...
متن کامل